Can Anonymous Posters on Medical Forums be Reidentified?
نویسندگان
چکیده
BACKGROUND Participants in medical forums often reveal personal health information about themselves in their online postings. To feel comfortable revealing sensitive personal health information, some participants may hide their identity by posting anonymously. They can do this by using fake identities, nicknames, or pseudonyms that cannot readily be traced back to them. However, individual writing styles have unique features and it may be possible to determine the true identity of an anonymous user through author attribution analysis. Although there has been previous work on the authorship attribution problem, there has been a dearth of research on automated authorship attribution on medical forums. The focus of the paper is to demonstrate that character-based author attribution works better than word-based methods in medical forums. OBJECTIVE The goal was to build a system that accurately attributes authorship of messages posted on medical forums. The Authorship Attributor system uses text analysis techniques to crawl medical forums and automatically correlate messages written by the same authors. Authorship Attributor processes unstructured texts regardless of the document type, context, and content. METHODS The messages were labeled by nicknames of the forum participants. We evaluated the system's performance through its accuracy on 6000 messages gathered from 2 medical forums on an in vitro fertilization (IVF) support website. RESULTS Given 2 lists of candidate authors (30 and 50 candidates, respectively), we obtained an F score accuracy in detecting authors of 75% to 80% on messages containing 100 to 150 words on average, and 97.9% on longer messages containing at least 300 words. CONCLUSIONS Authorship can be successfully detected in short free-form messages posted on medical forums. This raises a concern about the meaningfulness of anonymous posting on such medical forums. Authorship attribution tools can be used to warn consumers wishing to post anonymously about the likelihood of their identity being determined.
منابع مشابه
Sources of Information and Behavioral Patterns in Online Health Forums: Observational Study
BACKGROUND Increasing numbers of patients are raising their voice in online forums. This shift is welcome as an act of patient autonomy, reflected in the term "expert patient". At the same time, there is considerable concern that patients can be easily misguided by pseudoscientific research and debate. Little is known about the sources of information used in health-related online forums, how us...
متن کاملSecure and anonymous decentralized Bitcoin mixing
The decentralized digital currency Bitcoin presents an anonymous alternative to the centralized banking system and indeed enjoys widespread and increasing adoption. Recent works, however, show how users can be reidentified and their payments linked based on Bitcoin’s most central element, the blockchain, a public ledger of all transactions. Thus, many regard Bitcoin’s central promise of financi...
متن کاملReal-world experience with colorectal cancer chemotherapies: patient web forum analysis
BACKGROUND In contrast to clinical trials, patient web forums provide a unique opportunity for patients to spontaneously post their experiences and thoughts about diseases and treatments. This study explored the impact of colorectal cancer (CRC) treatments in these forums. METHODS This was a systematic cross-sectional qualitative analysis. Two active CRC web forums were identified based on fo...
متن کاملTorrenting values, feelings, and thoughts—Cyber nursing and virtual self-care in a breast augmentation forum
Earlier research shows that breast augmentation is positively correlated with positive psychological states. The aim of this study was to explore the shared values, feelings, and thoughts within the culture of breast enlargement among women visiting Internet-based forums when considering and/or undergoing esthetic plastic surgery. The study used a netnographic method for gathering and analyzing...
متن کاملInformation Use in Online Civic Discourse: A Study of Health Care Reform Debate
This article reports on a study of civic discourse in online political forums. On March 23, 2010, the Patient Protection and Affordable Care Act was signed into law in the United States after heated debate. Some of the debate took place online, often in political forums. This study describes and analyzes the information used to frame and support participants’ opinions within the online environm...
متن کامل